Using Concept Maps in NDLTD as a Cross-Language Summarization Tool for Computing–Related ETDs
نویسندگان
چکیده
heading Concept maps, introduced by Novak, aid learners’ understanding. We hypothesize that concept maps also can function as a summary of large documents (e.g., ETDs). Our system automatically generates concept maps from English-language ETDs in the computing field. The system also will provide Spanish translations of these concept maps for native Spanish speakers. Because of the results of our enhanced machine translation techniques, we believe concept maps could allow researchers to discover pertinent dissertations in languages they cannot read, helping them to decide if they want a potentially relevant dissertation translated. We are using a state-of-the-art natural language processing system, called Relex, first to extract noun phrases and noun-verb-noun relations from ETDs, and then to produce concept maps automatically. We also have incorporated information from the table of contents of ETDs to create novel styles of concept maps. Currently we are producing concept maps for the Virginia Tech CS collection (175 ETDs), which covers a broad range of computer science topics. We intend to automatically produce concept maps for computing-related ETDs for a larger segment of the NDLTD holdings. We have recently conducted two user studies, to evaluate user perceptions about these different map styles. We are using several methods to translate node and link text in concept maps from English to Spanish. Nodes labeled with single words from a given technical area can be translated using word lists, but phrases in specific technical fields can be difficult to translate. Thus we have amassed a collection of about 580 Spanish-language ETDs, from Scirus and two Mexican universities, and we are using this corpus to mine phrase translations that we could not find otherwise. We also have tested the usefulness of the automatically-generated and translated concept maps in a user experiment conducted at Universidad de las Americas (UDLA) in Puebla, Mexico. This experiment provides insights regarding if concept maps can augment abstracts (translated using a standard machine translation package) in helping Spanish speaking users find ETDs of interest.
منابع مشابه
Automatic Creation and Translation of Concept Maps for Computer Science-related Theses and Dissertations
Concept maps are often used as tools to enhance student learning on new topics. We hypothesize that they also can be used as summarization tools for large documents. We hypothesize further that, because concept maps usually have short text sections (words and short phrases), it should be easier to translate concept maps than parts of documents (such as abstracts), since the text need not be nat...
متن کاملAn OAI-Based Filtering Service for CITIDEL from NDLTD
One goal of the Computing and Information Technology Interactive Digital Educational Library (CITIDEL) is to maximize the number of computing-related resources available to computer science scholars and practitioners through it. In this paper, we describe a set of experiments designed to help this goal by adding to CITIDEL a sub-collection of computing related electronic theses and dissertation...
متن کاملTopical Categorization of Large Collections of Electronic Theses and Dissertations
Electronic Theses and Dissertations (ETDs) form an important part of scholarly work. Many universities in the USA, and other parts of the world, require their students to submit their theses and dissertations in electronic form. The ETDs are hosted by the respective universities, and no single point of access exists to the different ETD collections. Various initiatives like NDLTD have aimed to ...
متن کاملThe Evolving Genre of Electronic Theses and Dissertations
Electronic theses and dissertations (ETDs) are a unique genre that is emerging in part as a result of the work to build the Networked Digital Library of Theses and Dissertations (NDLTD). Virginia Tech began requiring ETDs January 1, 1997 and has since received over 1450. Quality has already improved and what has been learned is more broadly shared now due to the national and international inter...
متن کاملNetworked Digital Library of Theses and Dissertations: Bridging the Gaps for Global Access - Part 2: Services and Research
The Networked Digital Library of Theses and Dissertations (NDLTD) is a collaborative effort of universities around the world to promote creating, archiving, distributing and accessing Electronic Theses and Dissertations (ETDs). Since its inception in 1996, over a hundred universities have joined the initiative, underscoring the importance institutions place on training their graduates in the em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007